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ABSTRACT 

We present a weak-leasing analysis of the merging Frontier Fields (FF) cluster Abell 2744 using 
new Subaru/Suprime-Cam imaging. The wide-field leasing mass distribution reveals this cluster is 
comprised of four distinct substructures. Simultaneously modeling the two-dimensional reduced shear 
field using a combination of a Navarro-Frenk-White (NEW) model for the main core and truncated 
NEW models for the subhalos, we determine their masses and locations. The total mass of the 
system is constrained as M 200 C = (2.06 ± 0.42) x 10^^ Mq. The most massive clump is the southern 
component with M200C = (7.7 ± 3.4) x 10^^ Mq, followed by the western substructure (M200C = 

(4.5 ±2.0) X 10^^ Mq) and two smaller substructures to the northeast (M 200 C = (2.8 ±1.6) x 10^^ Mq) 
and northwest (M 200 C = (1-9 ± 1.2) x 10^^ Mq). The presence of the four substructures supports 
the picture of multiple mergers. Using a composite of hydro dynamical binary simulations we explain 
this complicated system without the need for a “slingshot” effect to produce the northwest X-ray 
interloper, as previously proposed. The locations of the substructures appear to be offset from both 
the gas (87^28 arcsec, 90% GL) and the galaxies ( 72 I 53 arcsec, 90% GL) in the case of the northwestern 
and western subhalos. To confirm or refute these findings, high resolution space-based observations 
extending beyond the current FF limited coverage to the west and northwestern area are essential. 

Subjeet headings: cosmology: observations — dark matter — galaxies: clusters: individual (Abell 
2744) — gravitational lensing: weak 


1. INTRODUCTION 

Glusters of galaxies depict the most recent bound phase 
of the hierarchical structure formation, evolving and re¬ 
laxing by accreting matter along large scale filaments. 
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In some extreme cases, clusters are caught in the act of 
violent head-on collisions between groups or clusters of 
similar masses. A key tool in understanding the physics 
governing such events is the comparison between the dis¬ 
tributions of the three main components involved - the 
dark matter (DM), the hot intracluster gas and the stars 
in galaxies. 

Although it comprises 85% of all the matter in the 
Universe (Hinshaw et al. 2013), the nature of DM is still 
largely unknown. It is thought to be cold and collision- 
less like the galaxies. The intracluster gas, on the other 
hand, is highly dissipative and can, in extreme cases, 
be separated from the DM and galaxies by ram-pressure 
stripping. The best example to date of such stripping is 
seen in the “Bullet” cluster (Markevitch et al. 2002 ), a 
high-velocity merger viewed in the plane of the sky. The 
gas of the bullet, as seen by X-ray, is lagging behind the 
DM component, deduced from weak gravitational lens¬ 
ing (WL) (Clowe et al. 2004). Measuring the displace¬ 
ment of the X-ray gas peaks from the gravitational mass 
peaks in this cluster provided the first upper limit on the 
cross-section for collision of DM, < lcm^g“^ (Marke¬ 
vitch et al. 2004). 

Due to their enormous gravitation potential, clusters 
of galaxies also act as powerful cosmic lenses, enhanc¬ 
ing light coming from galaxies that formed in the early 
universe. Recent discoveries were made of galaxies out 
to redshifts as high as z ^ 11 (Zheng et al. 2012 ; Coe 
et al. 2013), being magnified by massive clusters observed 
as part of CLASH, a multi-cycle treasury program with 
the Hubble Spaee Teleseope (TfSTj(Postman et al. 2012 ). 
This inspired the ongoing Frontier Fields^^ (FF) initia- 

http://www.stsci.edu/hst/campaigns/frontier-fields/ 
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tive (Lotz et al. 2014), an ambitious HST program to 
deeply observe six strong-lens (SL) clusters and detect 
high-redshift galaxies. The legacy FF program will for¬ 
tuitously provide deep, high-resolution imaging of the 
cluster cores. Accurate maps of the mass distribution in 
these clusters are needed in order to quantify the mag¬ 
nification of the distant galaxies and measure the star 
formation history in the early universe. 

Most of the FF clusters are known to be non-relaxed 
systems. One of these clusters, Abell 2744 (hereafter 
A2744; ^ = 0.308) (Lotz 2013), shows one of the most 
complicated merger phenomena to have been detected. 
It was first recognized as a merger by the presence of a 
luminous radio halo and a radio relic (Giovannini et al. 
1999; Govoni et al. 2001a, b). Chandra X-ray studies re¬ 
vealed intricate substructure (Kempner & David 2004; 
Owers et al. 2011), including a prominent hot gas cloud 
situated between the two main galaxy partitions, cold 
and dense remnant cores to the south and north, tidal 
debris between them, and an interloping cloud of gas 
to the north-west, with no associated galaxies. These 
all pointed to a north-south merger with roughly equal 
mass, and a further dynamical study indicated there is a 
large line-of-sight (LOS) component to the merger axis 
(Boschin et al. 2006). However, it did not give a clear 
picture where lies the massive cluster core, to the north 
(Owers et al. 2011) or to the south (Kempner & David 
2004), and whether the northwest X-ray interloper was 
falling in or had already passed from the southeast. A 
recent lensing analysis by (Merten et al. 2011, hereafter 
Mil), mainly using HST and VLT imaging data, showed 
that the main cluster potential is situated near the south¬ 
ern part, but confounded the picture even more by find¬ 
ing a “dark core” of DM-only material between the two 
gas clouds, whereas the X-ray interloper seemed to be 
empty of DM as well as galaxies. Mil suggested a com¬ 
plicated “slingshot” scenario where the gas was thrown 
past the galaxies and DM, to try and explain this unclear 
picture. 

In this paper, we perform an improved wide-held 
WL analysis of the cluster A2744, using new deep 
Subaru/Suprime-Gam observations that cover the full ex¬ 
tent (< 5 Mpc) of this intermediate redshift cluster. Our 
multi-band imaging allows us to obtain a reconstructed 
cluster total mass distribution free of dilution by fore¬ 
ground structures, a main source of systematics in WL 
studies. We aim to detect and accurately constrain the 
physical properties of the different substructures of this 
merging cluster, so that a clear picture of the merger sce¬ 
nario can be illustrated by custom-made hydro dynamical 
simulations. 

This paper is organized as follows. In Section 2 we 
briehy summarize the basic theory of WL. In Section 3 
we present the observation, their reduction and process¬ 
ing. In Section 4 we present the WL analysis, includ¬ 
ing the mass reconstruction, substructure detection and 
multi-halo modeling of their masses. In Section 5 we 
discuss our results, compare with earlier studies and re¬ 
visit the interpretation of the merger scenario in light 
of our hndings. We hnally summarize in Section 6. 
Throughout this paper, we use the AB magnitude sys¬ 
tem, and adopt a concordance ACDM cosmology with 
Qrn = 0-3, = 0.7, and Hq = 100 hkms“^ Mpc“^ 

with h = 0 . 7/170 = 0.7. In this cosmology, V corre¬ 


sponds to 272 kpc at the cluster redshift, 2 : = 0.308. The 
cluster center used is R.A.=00:14:18.9, Decl.= —30:23:22 
(J2000.0) (Ebeling et al. 2010), which closely corresponds 
to the cluster X-ray brightness peak. 


2. WL METHODOLOGY 

In this work, we infer from lensing the surface mass 
density, a.k.a. convergence, = X( 0 )/Xcrit, which 

is expressed in units of the critical surface-mass density 
for lensing, Sent = {c^Ds)/(A'kGDiDis) = /{AirGDip), 

where P{zs^z\) = D\^/D^ is the lensing “depth” for a 
source at redshift 2 : 3 , and Di, Dg, and D\^ are the lens, 
source, and lens-source angular diameter distances, re¬ 
spectively. 

The shear field induced by gravitational lensing, 7 = 
71+^725 is non-locally related to the convergence through 


7(6») = 1 y (To'Die - e')K{e') (i) 

with the kernel being D{0) = {$2 — 0^ — 2i0i02)/\0\^ 
(Kaiser & Squires 1993). The convolution theorem yields 
in turn the convergence, 

k( 6») = 1 y <ro'D*{e - d')'i{e') (2) 

In the weak regime, the reduced shear (see, e.g., Bartel- 
mann & Schneider 2001), 

g{d)^^{d)/{l-K{d)\ (3) 


is the actual observable, derived from the observed ellip- 
ticities of background galaxies (Kaiser et al. 1995). We 
calculate the weighted average of reduced shear on a pix- 
elized Cartesian grid (m = 1 , 2 ,..., A/pix) as 




Ei S{0i,0m)Wi 


( 4 ) 


where S{0i^0m) is a spatial window function, gi is the 
estimate for the reduced shear of the ith object at 0 ^, 
and Wi is the statistical weight for the ith object, given 
by 


where is the error variance of gi^ and a‘^ is a softening 
constant variance. We set a = 0.4, which is a typical 
value of the mean RMS ag found in Subaru observations 
(e.g., Umetsu & Broadhurst 2008; Okabe et al. 2010). 
The variance on the pixelized shear map is then given by 
Umetsu et al. (2009, 2015) 


O-liOm) 


T,i 


(6) 


3. SUBARU OBSERVATIONS 

In this section we present the data reduction and anal¬ 
ysis of A2744 based on deep multi-color imaging (Sec¬ 
tion 3.1). We briefly describe our WL shape measure¬ 
ment procedure in Section 3.2. The galaxy sample selec¬ 
tion is described in Section 3.3 and the samples depth 
estimation is given in Section 3.4. 
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Figure 1. Left: 26' x 26' Subaru BRqz' false color image of A2744 {z = 0.308). We overlay the surface mass density map reconstructed 
from the Subaru WL analysis (white, starting at 1.5(7/^, in Scr^c increments), and the luminosity density map (yellow dashed). North is up 
and east is to the left. Right: A zoomed image on the cluster inner 3' X 3'. Overlaid here are the surface mass density (white, starting at 
3(7^:, in 2aK increments) and X-ray brightness contours (red, 2.5 — 9crx in logarithmic spacing) from Chandra^ depicting the gas density. 
Notably, there is no WL mass overdensity at the location of the X-ray interloper, but there is one located eastern of it. Additionally, the 
second most massive western substructure detected in our WL map is not traced by the X-ray map. 


Table 1 

Optical Imaging Data 


Instrument 

Filter 

Exposure time 
(ks) 

Seeing 

(arcsec) 

mum^ 
(AB mag) 

Subaru/S-cam 

B 

2.1 

1.08 

27.58 

ESO/WFI 

V 

2.7 

0.9 

25.59 

Subaru/S-cam 


3.12 

1.16 

26.83 

Subaru/S-cam 

i' 

1.68 

1.42 

26.31 

Subaru/S-cam 

z' 

3.6 

1.07 

26.03 

VLT/HAWK-P 

Ks 

47.52 

0.4 

25.25 


^ Limiting magnitude for a 3cr detection within a 2" aperture. 
^ Band used for WL shape measurements. 

Instrument only covers central 4'. 


3.1. Data reduction and Photometry 

A previous joint SL+WL study of this cluster was pre¬ 
sented in Mil, combining HST/ACS data along with 
wide-field imaging from VLT/VIMOS and the Subaru 
Telescope. However, The Subaru observations were 
taken under poor conditions, and in only one band 
(FWHM^/ ~ 1.5"). Therefore, Mil leasing analysis was 
governed by the SL+WL in the inner 3' x 3', whereas its 
wide-held WL analysis was shallow and limited to the 
central 9' x 9'. 

We analyze here new data obtained with the wide-held 
camera Suprime-Cam (30' x 30' Miyazaki et al. 2002) at 
the prime focus of the 8.3-m Subaru on the nights of 
2013 July 14 & 15, taken in the B^Rq^z' bands. We 
add to these the old data from Subaru %' band, archival 
ESO/WFI^^ V (ESO-843; program 079.A-0805) band, 
and archival VLT/HAWK-I band (program 092.A- 

http://archive.eso.org 


0472), in order to improve the accuracy of the photo¬ 
metric redshifts, used to determine the sample depth 
(Section 3.4). The observation details of A2744 are sum¬ 
marized in Table 1. Figure 1 shows a BRqz' composite 
color image of the cluster central 26' x 26' (left panel), 
produced with the Stiff software (Bertin 2012). We 
overlay it with the total mass density map determined 
from WL (white contours), the luminosity density map 
(yellow contours on the left panel) and the smoothed 
Chandra X-ray luminosity map (red contours on the right 
panel) which we describe in Section 4.2. 

Our reduction pipeline derives from Nonino et al. 
(2009) and SDFRED (Yagi et al. 2002; Ouchi et al. 2004) 
and has been optimized for accurate photometry and 
WL shape measurements. It has been fully described 
in Medezinski et al. (2013). We refer the reader to this 
work for full details and briefly summarize the main steps 
here. Standard reduction steps include bias subtraction, 
flat and super-flat field correction and point-spread func¬ 
tion (PSF) matching between exposures in the same band 
(to some intermediate level). Masking of saturated star 
trails and other artifacts is then applied. We derive an 
astrometric solution with the SCAMP software (Bertin 
2006) using VISTA/VIRCAM z’-band image from the 
ESO archive, which in turn has been tied to 2MASS^^ as 
an external reference catalog. Finally, the SwARP soft¬ 
ware (Bertin et al. 2002) is utilized to stack the single 
exposures on a common WCS grid with pixel-scale of 

This publication makes use of data products from the Two 
Micron All Sky Survey, which is a joint project of the Univer¬ 
sity of Massachusetts and the Infrared Processing and Analysis 
Center/California Institute of Technology, funded by the National 
Aeronautics and Space Administration and the National Science 
Foundation 
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0 . 2 ". 

The photometric zero-points were derived from a suit¬ 
able set of Standard stars. These zero-points were refined 
by fitting spectral energy distribution (SED) templates 
with the BPZ code (Bayesian photometric redshift esti¬ 
mation, Benftez 2000; Benitez et al. 2004) to the colors of 
1311 galaxies having spectroscopic redshifts from Owers 
et al. (2011). This leads to a final photometric accuracy 
of ^ 0.005 — 0.02 mag in the different passbands. The 
6-band BVRci'z'Ks photometry catalog was measured 
using SExtractor (Bertin et al. 2002) in dual-image mode 
on PSE-matched images created by ColorPro (Coe et al. 
2006), where a combination of B V Rc -\- i' -\- z' 
bands was used as a deep detection image. The stellar 
PSEs were measured from 100 stars and modeled using 
IRAE routines. The number density of galaxies in our 
photometric catalog is fig ^ 64arcmin“^. 

3.2. Shape Measurement 

Eor shape measurements, we use our WL analysis 
pipeline which is based on the IMCAT package (Kaiser 
et al. 1995, KSB hereafter), incorporating modifications 
and improvements developed and outlined in Umetsu 
et al. (2010). Our KSB-f implementation has been ap¬ 
plied extensively to Subaru/Suprime-Cam cluster ob¬ 
servations (e.g., Coe et al. 2012; Merten et al. 2015; 
Medezinski et al. 2010, 2011, 2013; Okabe & Umetsu 
2008; Umetsu & Broadhurst 2008; Umetsu et al. 2009, 
2010, 2011b,a, 2012; Zitrin et al. 2011, 2013). Eull de¬ 
tails of our WL analysis pipeline are presented in Umetsu 
et al. (2012) and Umetsu et al. (2014). Here we reit¬ 
erate some of the key aspects of the pipeline for com¬ 
pleteness. Objects are detected using the IMCAT peak 
finding algorithm, HFINDPEAKS, providing a peak posi¬ 
tion, Gaussian scale length, r^, and an estimate of the 
significance of the peak detection, v. Close pairs of ob¬ 
jects detected above n = 1 and having a neighbor within 
3r^ are rejected to avoid shape measurement bias. Then 
all objects with < 10 are excluded from the analysis. 
KSB’s isotropic correction factor, P^, is calibrated as a 
function of object size (r^) and magnitude, using galax¬ 
ies detected with high significance z/ > 30 (Umetsu et al. 
2010), so as to minimize the inherent shear calibration 
bias. Einally, we apply it to obtain the shape estimate 
as Qo, = eo^jPg^ with Cq, being the anisotropy-corrected 
ellipticity. 

In this analysis we use the Pc-band for shape mea¬ 
surement, which was taken in medium seeing conditions, 
but is significantly better compared to the earlier i'-band 
data taken in 2008. We do not PSE-match the single 
exposures before stacking (as done for photometric mea¬ 
surements), to retain the WL information derived from 
the shapes of galaxies. In addition to the v cuts de¬ 
scribed above, we apply the following stringent size cuts 
(Umetsu et al. 2012) when deriving the final shape cat¬ 
alog: Tg > mode(rp and Vh > mode(r^) -h 1.5cr(r^), 
where is the object half-light radius and the subscript 
asterisk denotes quantities for stellar objects. 

We restrict the catalog to the central 26' x 26' region 
to ensure accurate PSE characterization. Based on sim¬ 
ulated Subaru Suprime-Cam images (see Section 3.2 of 
Oguri et al. 2012; Massey et al. 2007), we include in our 
analysis a calibration factor of 1/0.95 as Qi gi/f).^b to 
account for residual calibration. The resulting shape cat- 


Table 2 

Galaxy color selection. 


Sample 

Magnitude limits 
(AB mag) 

^9 

riq 

('-5 

(zs) 


w 

green 

16.3 <z' < 23.5 

1135 

1.1 

0.3 

0.3 

0.07 ±0.07 

red 

21.0 < z' < 25.6 

4149 

5.7 

1.4 

1.2 

0.68 ±0.07 

blue 

22.0 <z' < 26.7 

3777 

5.1 

1.7 

1.2 

0.67 ±0.16 

red+blue 

21.0 <z' < 26.7 

7926 

10.8 

1.5 

1.2 

0.68 ±0.11 


Note. — Column (1): samples used in this analysis; the red 
and blue samples statistics are derived after matching with the shape 
catalog. Column (2): Magnitude limits for the galaxy sample. Col¬ 
umn (3): Number of galaxies. Column (4): Mean surface number 
density of galaxies. Column (5): Mean photometric redshift of the 
sample obtained with the BPZ code. Column (6): Effective source 
redshift corresponding to the mean depth {/3} of the sample. Col¬ 
umn (7): Distance ratio averaged over the redshift distribution of the 
sample. 

alog has a number density of galaxies, fig ^ 25 arcrnin”^, 
including cluster members and foreground galaxies. 

3.3. Sample Seleetion 

Here we describe our selection of cluster and back¬ 
ground galaxy samples, trying to maximize the number 
of usable galaxies for our WL analysis while avoiding di¬ 
lution of the WL signal by cluster and foreground mem¬ 
bers. We use the P, Pc, z' Subaru imaging which spans 
the full optical wavelength range to perform color-color 
(CC) selection, as established in Medezinski et al. (2010). 

3.3.1. Cluster Sample 

We plot the B — Rq vs Rq — z' distribution of all galax¬ 
ies from our photometric sample to the limiting magni¬ 
tude (Eigure 2, cyan). We identify the cluster-dominated 
area in this CC-space by separately plotting the number 
density in P — Rq vs. Rq — z' using only galaxies having 
small projected cluster-centric radii, 0 < 2' { ^ 0.4 Mpc 
at z\ = 0.308). We specify a region above some charac¬ 
teristic overdensity in this space (shown as a solid green 
curve in Eigure 2). Then, all galaxies within this distinc¬ 
tive region from the full CC diagram define the “green” 
sample (Eigure 2, green points), comprising mostly the 
red-sequence of the cluster and a blue trail of later-type 
cluster members. This is also demonstrated by overlay¬ 
ing spectroscopically-selected cluster members (Eigure 2, 
black points), which nicely coincides with the location of 
the green sample where the red-sequence lies. 

The WL signal for the “green” population is consis¬ 
tent with zero at all radii (Eigure 4, green crosses), 
confirming it contains no background. Eor this popu¬ 
lation of galaxies, we find a mean photometric redshift 
of {zphot) ~ 0.305 (see Section 3.4), consistent with the 
cluster redshift. Importantly, the green sample marks 
a region that contains a majority of unlensed galaxies, 
relative to which we select our background samples, as 
summarized below. 

3.3.2. Baekground Sample 

It is critical to avoid inclusion of unlensed cluster mem¬ 
bers and foreground galaxies in the background sample 
used for the WL analysis, so as not to dilute the true 
lensing signal (Broadhurst et al. 2005; Medezinski et al. 
2007, 2010). This dilution effect reduces the strength of 
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Figure 2. B — Rq vs. Rq — z' color-color diagram of all the 
galaxies in the Subaru imaging (cyan). “Blue” and “red” back¬ 
ground galaxies (lower-left blue and lower-right red, respectively) 
are selected for WL mass analysis. The “green” sample (green 
solid contour), defined by an overdense region of small cluster¬ 
centric radii galaxies, comprises mostly red sequence and some blue 
later type cluster members. The background samples are well iso¬ 
lated from the green region and satisfy other criteria as discussed 
in Section 3.3.2. Our background selection successfully excludes 
all spectroscopically confirmed cluster members (black; though see 
text for details). 


I red BG 
I blue BG 
I green CL 


0 0.5 1 1.5 2 2.5 3 3.5 4 


Figure 3. Normalized redshift distributions of green, red and 
blue samples defined by the CC-selection. Redshift estimates are 
derived from BPZ photo- 2 ;’s. The cluster redshift is marked with 
a black line. 


all, our CC-selection criteria provides Ng = 1135,17141, 
and 16281 galaxies, in the green, red, and blue photom¬ 
etry samples, respectively. For our WL distortion analy¬ 
sis, we have a subset of 4149 and 3777 galaxies in the red 
and blue samples (with usable Rq shape measurements), 
respectively, or a total background number density of 
fig = 10.8 arcmin“^. 


the leasing signal (by a factor of 2-5 ai R ^ 400kpch“^; 
see Figure 1 of Broadhurst et al. 2005), particularly at 
small radii where the cluster potential is largest, in pro¬ 
portion to the fraction of unlensed galaxies whose orien¬ 
tations are randomly distributed. 

We use the CC selection method of Medezinski et al. 
(2010) to define undiluted samples of background galax¬ 
ies, (for details, see Medezinski et al. 2010; Umetsu et al. 
2010), which has been successfully applied to many clus¬ 
ter WL analyses (Medezinski et al. 2010, 2011, 2013; 
Merten et al. 2015; Umetsu et al. 2010, 2011b, 2012, 
2014, 2015). We make use of the (B — Rq) vs. {Rq — z') 
CC-diagram to encompass the red and blue branches of 
galaxies, of the WL signal is visible, to minimize con¬ 
tamination by unlensed cluster members and foreground 
galaxies. The color boundaries of our “blue” and “red” 
background samples are shown in Figure 2. The magni¬ 
tude limits of all the samples are given in Table 2. For 
both the blue and red samples, we find a consistent tan¬ 
gential shear profile (see Section 4.1), rising all the way 
to the cluster center, as expected (see Figure 4). Fur¬ 
thermore, none of the spectroscopically-selected cluster 
members (black points in Figure 2) are present in the 
blue or red samples, confirming the purity of our sam¬ 
ple. The two black points that only appear to be within 
the color boundaries of the blue background sample are 
in fact not included in it, as the faintest spec-z member 
(z' = 21.8) is brighter than the bright blue magnitude 
limit {z' > 22). Finally, The percentage of galaxies in the 
background sample that have photo-z’s (see Section 3.4) 
smaller than the cluster redshift (all of which are in the 
blue sample) is small, just 1.6% of the background sam¬ 
ple, demonstrating a negligible contamination level. 

Full details of our samples are listed in Table 2. Over- 


3.4. Depth Estimation 

We use the BPZ code to measure photometric redshifts 
(photo- 2 ;s) Zphot from our deep Subaru+WFI+HAWK-I 
BVRci'z'Ks photometry (Section 3.1, though note 
only covers the central 4'). We present the redshift dis¬ 
tributions of the red, blue and green samples in Fig¬ 
ure 3. As can be seen, the green sample lies at about 
the cluster redshift, z\ = 0.308, whereas the red and the 
blue samples lie mostly at higher redshifts, z^hot ^0.5. 
According to the photo- 2 ;’s of our background sample 
(Section 3.3.2), the level of foreground contamination 
is negligible (< 2%). We emphasize, that although the 
redshift distribution of our CC-selected sample is good 
enough to demonstrate the reliability of our selection of 
background-only galaxies, the reverse is not true - using 
photo-z’s for background selection that are based on only 
^ 5 optical bands is not sufficiently unbiased, as photo¬ 
n’s are highly degenerate, especially for blue galaxies that 
have relatively fiat SEDs. We, on the other hand, us¬ 
ing CC-selection, can select those faint blue background 
galaxies that lie at high redshifts, nphot = 2 — 3 (bottom- 
left corner of Figure 2). 

To convert to physical mass units, we need to esti¬ 
mate the mean depths ((/d), see Section 2) of the back¬ 
ground samples used in our WL analysis. We follow the 
prescription devised in Umetsu et al. (2014), and ex¬ 
clude galaxies above nphot >2.5 and having ODDS< 0.8 
(as given by the BPZ code). In Table 2 we summa¬ 
rize the mean depths {/3) and the effective source red- 
shifts ^s,eff for orir background samples. The mean depth 
for the combined blue and red sample of background 
galaxies is (/3back) = 0.68 ± 0.11, which corresponds to 

^s,eff = 1-2 ± 0.1. 
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Figure 4. Binned tangential reduced-shear (upper panel) and 
the 45° rotated (x) component gx (lower panel) as a function of 
cluster radius, for our red (triangles), blue (circles), green (crosses) 
and blue+red (squares) galaxy samples, horizontally shifted for 
visual clarity. The x-component is consistent with a null signal 
detection to within Icr, except in the strong regime, 9^2', where 
the cluster morphology is complicated by substructure. 

4. WL ANALYSIS 

4.1. Tangential Distortion Analysis 

The tangential component of the reduced-shear, is 

used to obtain the azimuthally-averaged distortion due 
to lensing: 

=-(fli cos 20 + ^2 sin 20), (7) 

where 0 is the position angle of an object with respect to 
the cluster center, and 5 ^ 1 , ^2 are the Cartesian distortion 
coefficients. 

In Figure 4 we plot the radial profile for the green, 
red and blue samples. The black points represent the 
red+blue combined sample. The shear profiles obtained 
from both the red and blue sample rise toward the cen¬ 
ter of the cluster, and agree with each other within the 
errors, demonstrating that both these samples are dom¬ 
inated by background galaxies. It is only below 0^2' 
that we find some disagreement, likely due to the inner 
cluster substructure. The profile of the green sample 
agrees with zero at all radii, supporting the interpreta¬ 
tion that it consists of mostly cluster members. 

4.2. Total Mass, Gas and Light Distributions 

To recover the underlying projected mass density field 
S(0), we use the Kaiser & Squires (1993) linear in¬ 
version method (also see Umetsu et al. 2009, Section 
4.4). We first pixelize the distortion measurements of 
the background (blue+red) sample (Section 3.3.2) us¬ 
ing Equation (4) onto a 200 x 200 regular grid over 
the central 26' x 26' region, with a Gaussian kernel, 
S{0) (X exp[—6>^/26>g], where 0^ = YWHM/ ln2 is the 
smoothing scale. We set FWHM = 1', chosen to opti¬ 
mally balance the number density of background galaxies 
available for the WL reconstruction, and to resolve the 
adjacent substructures in the core of the cluster, sepa¬ 
rated by an order of ^ 2'. We then invert the distortion 
map using Equation (2) to obtain the lensing convergence 
field, n{9), shown in Figure 5 (top left panel). 


Table 3 

Detected Substructures. 


Halo 

Yc 

(arcmin) 

Yc 

(arcmin) 

zi 


S/N 

Core 

-0.33 + 0.18 

-0.72 + 0.16 

0.32 + 0.01 

1.27 + 0.08 

12.1 

W 

1.36 + 0.42 

-0.59 + 0.43 

0.30 + 0.04 

1.27 + 0.07 

7.9 

NE 

-1.62 + 0.33 

0.85 + 0.53 

0.30 + 0.02 

1.27 + 0.08 

4.7 

NW 

0.98 + 0.37 

1.89 + 0.48 

0.30 + 0.01 

1.27 + 0.08 

7.0 


Note. — Column (2-3): The centroid positions of each halo, 
given relative to the cluster center in units of arcminutes. Errors 
are derived as the centroid standard deviation in the bootstrapped 
maps. Column (4): lens redshift, determined from “green” galaxies 
within 1' of the peak location. Column (5): effective source redshift, 
estimated from the background sample. Column (6): Signal-to-noise 
ratio determined from the convergence map. 

We compare the mass density map with both a 2D 
galaxy number-density map (Figure 5, top right panel) 
and a LC-corrected i?c-band luminosity density map (Fig¬ 
ure 5, bottom left panel) of the cluster members (utiliz¬ 
ing the green sample). Both maps are smoothed with a 
Gaussian kernel of the same scale as the mass map above. 
We also overlay the surface mass density map starting at 
3(7/^, in 2(7/^ increments (white contours), to illustrate the 
correlation between these. 

Finally, we utilize both Chandra and XMM-Newton 
public X-ray observations to examine how the gas is dis¬ 
tributed in the core and on larger scales. We acquire 
public XMM-Newton images from the XSA archive (PI 
Bohringer; totaling 11,587 sec). We perform point-source 
removal and interpolation on the MOSl+MOS2+pn 
stacked image, and smooth the image to FWHM= 0.667'. 
We present the map of X-ray emission on large scales 
from the XMM observations in Figure 5 (bottom right), 
along with the mass density contours (white). Chandra 
provides high-resolution imaging of the gas in the core 
of the cluster (3' x 3'). We overlay the X-ray bright¬ 
ness contours from the Chandra stacked image using the 
dataset as processed by Mil in Figure 1 (right panel, 
red contours), and also in Figure 6, starting at a higher 
level in order to demonstrate the fine details of the gas 
substructure. The main diffuse gas cloud is seen to he be¬ 
tween the cluster galaxy components, whereas an X-ray 
interloper is detected in the NW direction but beyond 
the BGGs in that location. The cool core (Tx = 7.7 keV) 
discussed in Owers et al. (2011) is detected in the bottom 
(south) of the main gas cloud. We will further discuss 
the correlations between the gas, stars and DM in Sec¬ 
tion 5.3, but first we need to model the different cluster 
DM components, which we present in Section 4.4. 

4.3. Suhstrueture Deteetion 

We are able to detect substructures in our convergence 
map down to spatial separations of ^ 1', which is the 
smoothing scale of our map. We apply a detection al¬ 
gorithm on our mass map, finding local maxima above 
4.5(7;.^, with separations of > 1'. We find four individual 
peaks with significance > These are labeled as 

Gore, W, NE, NW in the convergence map (Figure 5, 
top left, black pluses), detected at significance levels of 
12.1(7/^, 7.9(7/^, 4.7(7/^ and 7.0(7/^, respectively. These are 
also labeled in Figure 1 (right panel), and marked in 
Figure 6 (green pluses). We summarize the details of the 
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Figure 5. Top Left: The dimensionless surface-mass density distribution, or the lensing convergence k{6), reconstructed from Subaru 
distortion data. Black pluses denote the four density substructures comprising the cluster, detected at a significance level > Top 

Right: Surface number density distribution of cluster member galaxies. Also overlaid are the white contours of the surface-mass density 
map above Scr^^, in 2a^ increments. Bottom Left: Luminosity density distribution of cluster members, overlaid with the same surface-mass 
density contours (white). Bottom Right: X-ray brightness distribution from XMM-Newton, overlaid with the same surface-mass density 
contours (white). 


detected subhalos in Table 3. 

As discussed in Section 3.3, WL analyses may be sub¬ 
ject to several sources of systematics. Two main sources 
are the contamination of the background sample by fore¬ 
ground galaxies and the systematic errors from galaxy 
shape measurements. As presented, we have attempted 
to address the first by a careful selection of the source 
sample according to the CC-selection method (see Sec¬ 
tion 3.3.2). As for the latter, we have excluded from our 
shape catalog noisy measurements using several cuts (see 
Section 3.2). Here we investigate how these two differ¬ 
ent systematics affect the resulting mass reconstruction 
map, and subsequently, the detected substructures. 

Following Mil, we construct a Subaru shape catalog 
where we apply only the > mode(r^) + cut, which 
we dub the biased shape catalog. Additionally, we per¬ 
form a background selection based on photometric red- 
shifts, requiring that 2;phot > 0-5, as in Mil. We explore 


all three combinations of these two variants, and con¬ 
struct a convergence map for each of the following: (1) 
our constrained shape catalog, but with photo-^ selection 
applied; (2) the biased shape catalog, but with our CC- 
selection applied; and (3) the biased shape catalog, with 
photo-z selection applied. The latter is the most simi¬ 
lar to the sample used in Mil’s analysis, and in most 
WL analyses in the literature. We display the results in 
Figure 7, where we compare with our original analysis 
in the top left panel, case (1) presented in the top right 
panel, case (2) in the bottom left panel, and case (3) in 
the bottom right panel. 

Both these systematic effects change the resulting 
WL map significantly for the less massive substructures, 
whereas the main peak remains most significant and ap¬ 
proximately in the same location. The most affected by 
these are the locations of the W and NW clumps, which 
appear closer to their location in Mil’s analysis in case 





































Figure 6. As in Figure 1, color image of the cluster center 
(3' X 3'), showing the 3a^ surface mass density contour (white), 
X-ray brightness contours (red, in exponential scale in order to en¬ 
hance substructure) from Chandra, and the 90% confidence bounds 
on the centers of the detected substructures, as derived from 500 
bootstrapped convergence maps. The four substructures detected 
in the full constrained convergence map are marked by green pluses. 
We also mark the location of the four peaks found by Mil (cyan 
squares). 

(3) which is the closest to Mil’s WL selection. In both 
cases where the photo- 2 ; selection is applied (cases (1) 
and (3)) the WL map suffers from significant dilution 
compared to the CC selection cases, making the core less 
significant. 

Since background selection has such a large effect on 
the detected substructures’ locations, we attempt to ac¬ 
count for this systematic uncertainty by performing a 
bootstrap analysis using a wider source sample than our 
reference constrained CC-selected sample. For this, we 
join the above two samples, the photo-z selected (3) and 
CC-selected sample (2) from the biased shape catalog, 
together containing 18,068 galaxies, or a source density 
of fig = 24.6 arcrnin”^. We draw from this joint sam¬ 
ple 500 bootstrapped convergence maps, and apply the 
same detection algorithm as above. To associate them, 
we match the scattered peaks with the four original peaks 
within a matching radius of 3 times the smoothing PSF, 
3 X Og (see Section 4.2). For each substructure, we then 
calculate 90% confidence bounds, which we overlay in 
Figure 6 (green contours). The Core, our most signifi¬ 
cant peak, is detected in 98% of the bootstrap realiza¬ 
tions, and out of those it is the most significant in 91%, 
making it very robust. The W substructure is ranked 
the second-most massive or most massive in 53% of the 
realizations in which it is detected, while the NW and 
NE are ranked as 2nd or 1st in 48% and 17%, respec¬ 
tively. This demonstrates the ranks of the substructures 
significance are quite robust. 

We estimate the redshift of each component as the 
median photometric redshift of “green” galaxies (CC- 
selected to represent the cluster population; see Sec¬ 


tion 3.3.1) that he within 1' from the respective sub¬ 
halo peak. This substructure lens redshift is used to esti¬ 
mate the respective effective source redshift, 2^8,eff- These 
quantities are listed in Table 3 (columns 4-5). Accord¬ 
ing to the peak redshifts, all the substructures are part 
of the cluster, at z\ ^ 0.31 ±0.1. 


4.4. Multi-Halo Mass Modeling 

To estimate individual masses of the structures com¬ 
prising the cluster, we conduct a two dimensional (2D) 
shear fitting (Okabe et al. 2011; Watanabe et al. 2011), si¬ 
multaneously modeling the multiple halos detected in the 
previous section. The pixelized distortion map {g{9m)) 
(Equation (4)) and its variance map (Equation (6)) 
are constructed by bin averaging (S' = 1, whereas in the 
map-making we used Gaussian smoothing kernel) in each 
cell using the statistical weight Wi (Equation (5)) onto a 
208 X 208 grid of independent cells. 

The log-likelihood function of the shear fitting is cal¬ 
culated as (Umetsu et al. 2015): 

^ ^pix 2 

In/i^ = — — ^ ^ ^ ^ ~ 9a,m] {^g [9a,m ~ 9a,m] 

m a=l 

where ga,m is the theoretical expectation for ga,m = 
9a{9m)^ and Cg^rn is the shear covariance matrix, cal¬ 
culated from the 2D pixelized shear (see Equation (6)) 

as Cg^jYi = 2^Q,g^g,a{9fn^O’g^g(^0rfji^. 

To describe the mass distribution, we employ the fol¬ 
lowing models of halo mass density: 


1. The universal Navarro et al. (1996, NEW) mass 
density profile, given by the form 

PNFw{r) = —-—(9) 
{r/rs)[l^r/rsH 

where ps is the characteristic density, and is the 
characteristic scale radius at which the logarith¬ 
mic density slope is isothermal. The halo mass 
Ma is given by integrating the NEW profile (Equa¬ 
tion (9)) out to the radius ta, at which the mean 
density is A x ficrit(^i): the critical mass density of 
the universe at the cluster redshift, expressed as 
Ma = M(< ta) = (47 r/3)pcrit(^i)Ar^. We use 
A = 200 to define the halo mass, M 2 ooc- The de¬ 
gree of concentration is defined as, C 2 ooc = F 200 c/fs, 
and the characteristic density is then given by 
Ps= Ax pcrit/^ X C^/[ln(l + Ca) - Ca /(1 + Ca)]. 


2. The Baltz et al. (2009) model (aka truncated NEW, 
or tNEW) is a modification of the NEW model, 
which suppresses the NEW mass profile beyond a 
finite radius, r^. This model should better describe 
merging subhalos, which are not determined by the 
virial theorem but rather by the strong tidal force 
of the main cluster. It is expressed as (considering 
the n = 2 case) 


ptNFw{r) = 


ps 


{r/rs){l + r/rs 


1 2 


(1 + r/rt)2 


( 10 ) 


with Tt = T 200 C xr 2 ooc 5 where T 2 ooc is the ratio of the 
truncation to the r2ooc radius. The characteristic 
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Figure 7. Convergence maps for different shape catalogs and/or background selections. In each case, the black pluses mark the substructure 
peaks detected at above > 4.5(7^. Top left: using the constrained shape catalog with CC selection applied, the same as in our analysis. Top 
right: using the constrained shape catalog with photo-z selection applied. Bottom left: using the biased shape catalog with CC selection 
applied. Bottom right: using the biased shape catalog with photo-z selection, the most like Mil’s WL analysis. 


density, ps, is defined as in the NFW profile. Note, 
the virial mass we derive from this fit, M 200 C: is 
not strictly the 3D halo mass, as in the case of an 
NFW profile. We therefore infer the 3D mass inside 
r200c, Mbmo{< r200c), from the fitted parameter, 
using Equation (10) in Oguri & Hamana (2011). 

We consider two scenarios when modeling this cluster 
and its substructure: (a) a single mass halo given by 
the NFW model, not accounting for substructure in this 
case but modeling the cluster as a whole; and (b) four 
mass halos, with an NFW model for the Core (the most 
significant peak) and three tNFW models for the other 
substructures, W, NE and NW. 

Eor each case, we let different sets of parameters vary: 
for the 1-halo solution, (a), we let M 200 C and C 2 ooc of 
the mass profile vary, and also let the center of mass 
vary (Xc, Yc) vary, thus having four free parameters to 
fit for. Eor the sake of computational efficiency, we set a 
fiat prior on the mass and concentration in the range 
0 ^ Af^ 200 c/ 10^^-^0 ^ 4, 0 ^ C 200 C ^ 10 - We test 
several upper limits to confirm that the choice of pri¬ 


ors on the mass and concentration is uninformative. In 
order to be inclusive, yet not to confuse with nearby sub¬ 
structures, we impose a Gaussian prior on the peak cen¬ 
troid. We conservatively set the prior width to three 
times the scatter on the peak location estimated from 
the bootstrapped convergence maps (see Section 4.3), 
C^7r(Xc) = 0-54',Cr^(yc) = 0 - 48 '. 

Eor the 4-halo solution, (b), we set the concentration 
parameter for the main NEW halo and the three tNEW 
subhalos based on the known mass-concentration rela¬ 
tion taken from Bhattacharya et al. (2013). We have run 
several test to validate this assumption (see discussion 
below). Here, since our data are not sensitive enough 
to fit for the centroids simultaneously, we fix the cen¬ 
troids of the halos to the values derived from the con¬ 
vergence map, and set the errors on their location ac¬ 
cording to the scatters of the bootstrapped maps (see 
Table 3). We thus only fit for the halos masses, having 
four free parameters in total. We set the same flat pri¬ 
ors on the masses as above, 0 ^ M 200 C ^ 4 x 10^^ Mq. 
The truncation parameter, r2ooc: of fho three tNEW sub- 
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Table 4 

Best-Fit NFW Multi-Halo Parameters of the 2D Shear Analysis 


Halo 

w 

(aremin) 

Tc 

(aremin) 

R200C 

(Mpe) 

M 200 C 

(IQlSMo) 

ABIC 

\nE 

l-halo model (NFW) 






(1) Core 

0.04 ±0.18 

-0.41 ±0.16 

2.35 ±0.16 

2.06 ± 0.42 

14 

-6044 

4-halo model (NFW+3tNFW, T200 c = 1) 





(1) Core 

(2) W 

(3) NE 

(4) NW 
Total 

-0.32 ±0.18 
1.37 ±0.42 
-1.62 ±0.33 
0.98 ±0.37 

-0.71 ±0.16 
-0.58 ±0.43 
0.85 ±0.53 
1.89 ±0.48 

1.69 ±0.26 
1.42 ±0.22 
1.21 ±0.24 
1.06 ±0.23 

0.77 ±0.34 
0.45 ± 0.20 
0.28 ±0.16 
0.19 ±0.12 
1.76 ±0.23 

0 

-6034 


Note. — An NFW model was fitted to the main Core, and three tNFW to the 
others. The concentration parameter was fitted only for the case of 1-halo model, 
whereas in the 4-halo case the concentrations were set using the mass-concentration 
relation given by Bhattacharya et al. (2013). Where values are derived from the 
posterior distribution of the MCMC sampling, bi-weight center and scale estimators 
are reported. Columns (2-3): The centroid positions of each halo, given relative to the 
cluster center in units of arcminutes. The centroids are derived from peak detections in 
the convergence map, and the errors are derived from the scatter of the peaks detected 
and matched in 500 bootstrapped maps. Column (4): Halo size. Column (5): Halo 
mass. Column (6): Relative Bayesian information criteria (BIC) reported for each 
model. The ratio is given relative to our best model, 4-halo with T200 c = 1 which has 
the lowest BIC. Column (7): Bayesian evidence reported for each model. Evidence is 
calculated as the harmonic mean of the likelihood distribution. 


halos depends on the dynamical timescale, the relative 
strength of the parent to subhalo masses, the distance 
between the parent and subhalo, etc. It should, in prin¬ 
ciple, be left as a free parameter. However, given the 
complexity of this system and the limitations set by our 
data, we only explore two values, r 2 ooc = 1,3. The case 
of i" 200 c = 3 is typical for isolated cluster-sized halos, 
whereas r 2 ooc = 1 is typical for mildly truncated inter¬ 
acting halos (Takada & Jain 2003). The latter provides 
a more realistic value in describing the merging subhalos 
detected in our work (see Section 5). The marginalized 
parameters and confidence bounds are determined us¬ 
ing a Markov chain Monte Carlo (MCMC) method with 
standard Metropolis-Hastings sampling. We use the bi¬ 
weight mean and scale estimators of Beers et al. (1990) to 
characterize the marginalized one-dimensional posterior 
distributions. 

For each model we calculate the Bayesian informa¬ 
tion criteria (BIC) (Schwarz 1978), according to BIC = 
—2hiL-\-k In A/", where L is the maximum likelihood es¬ 
timator, k is the number of free parameters, and N is 
the number of constraints. We also quote the Bayesian 
evidence, calculated as the harmonic mean of the like¬ 
lihood distribution. Both these quantifiers consistently 
show strong indication that the 4-halo models are pre¬ 
ferred over the 1-halo model. Both 4-halo models of the 
two truncation cases, r 2 ooc = 1,3, gave virtually consis¬ 
tent masses. However, the r 2 ooc = 1 is marginally pre¬ 
ferred over the r 2 ooc = 3 case, according to their relative 
BIC, BICr2ooc=3 “ BICr 2 ooc=i = 2. Thus, wc concludc 
that we are not sensitive enough to fully constrain the 
tidal truncation radius of the individual merging subha¬ 
los but there is weak preference for the r 2 ooc = 1 case. 
In what follows, we therefore focus on the r 2 ooc = 1 re¬ 
sults. The marginalized parameter constraints of the 1- 
halo model and 4-halo (r 2 ooc = 1) model are summarized 


in Table 4. The two-dimensional marginalized posterior 
distributions for all pairs of parameters of the 1-halo and 
4-halo cases are shown in Figure 8 and Figure 9, respec¬ 
tively. 

Not shown in the table, for the 1-halo model we find 
the concentration parameter to be C2ooc = 3.5 ±0.8. This 
value is consistent with cluster-type halos of this size 
(Bhattacharya et al. 2013). For the 4-halo solution, we 
have run several tests letting the concentration parame¬ 
ter vary as well, in order to test the effect of fixing the 
mass-concentration ratio in our modeling. The masses 
remained nearly the same, fully consistent within the 
errors, whereas the concentration was not constrained 
very well (giving errors of Ac 2 ooc ^ 3, well beyond the 
range of values between models), and the overall confi¬ 
dence of the model was lower according to ABIC = 34 
(w.r.t. fixed concentrations). These low-mass subhalos 
have scale radii smaller than our resolution scale, so that 
one cannot constrain the concentrations well. 

We compare the results of the 1-halo and 4-halo model 
posterior shear profiles with the observed shear profile 
from Section 4.1 in Figure 10. Overall, the 4-halo model 
indeed describes the observed shear profile better, though 
it has a broad confidence bounds inside 0 < 2' ^ likely due 
to the contribution of the substructures near the cluster 
core. We further discuss our results below in Section 5. 

5. DISCUSSION 

In the previous section, we reconstructed the mass dis¬ 
tribution of the cluster, detected four distinct substruc¬ 
tures near its center, and modeled their masses and lo¬ 
cations directly from the shear data. In the following 
section we discuss our results and their implication on 
the interpretation of the merger. 

5.1. Total Cluster Mass from Different Methods 
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Figure 8. The posterior distribution of the 1-halo model parameters, M 200 C 5 C200c, Yc- The contours show the 68% and 95% confidence 
intervals. The solid lines show the marginalized bi-weight mean. 


A2744 is a very massive cluster, as indicated by all the 
different methods, consistently yielding - 2 X 10^^ M©. 
In the single-halo case, we found a total mass of Mtotai = 
(2.06 ± 0.42) X 10^^ M©, whereas in the 4-halo case, the 
total mass is Mtotai = (1.76 ± 0.23) x 10^^ M©, calcu¬ 
lated from the bi-weight mean posterior of the sum of 
the four halos. For comparison with Mil’s projected to¬ 
tal mass inside 1.3 Mpc, from the 1-halo model we find 
M 2 d(^ < 1.3Mpc) = (1.65 ± 0.23) x 10^^ M©, in very 
good agreement with Mil value, M(r < 1.3 Mpc) = 
(1.8 ±0.4) X 10^^ M©. 


5.2. Masses of the Different Cluster Components 

By utilizing our multi-halo modeling technique we were 
able to estimate the masses of the different components 
of the cluster. Overall, we find that the most massive 
halo is the main Core, with a mass of M 200 C = (0.77 ± 
0.34) X 10^^ M© when fitting 4-halos to the reduced shear 
field. This mass is lower than when fitting just one halo, 
M 200 C = (2.06 ± 0.42) X 10^^ M©, demonstrating that 
in order to best derive the mass of just the Core the 
different substructures need to be taken into account. 
We find the second most massive substructure to be the 
W clump, with a mass of M 200 C = (0.45±0.20) x 10^^ M©. 
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Figure 9. The posterior distribution of the 4-halo model parameters, in the case of T200 c = 1- The contours show the 68% and 95% 
confidence intervals. The solid lines show the marginalized bi-weight mean. 


Our modeling yields a close mass ratio between these two 
components, 1.7:1 (but the uncertainty on this value is 
large: ±1.1). The other two halos have lower masses, 
with M 200 C = (0.28 ± 0.16) x 10^^ Mq for the NE clump 
and M 200 C = (0.19 ±0.12) x 10^^ Mq for the NW clump. 

Mil had detected similarly four substructures. How¬ 
ever, only three - their Core, W and NW peaks - are 
more closely related to our peaks, albeit at slightly off¬ 
set locations (which we further discuss below). Mil 
found projected masses of, M(< 250kpc) = (2.24 ± 
0.55), (1.11 ±0.28), (1.15 ±0.23) x 10^^ M©, for the Core, 
W and NW peaks, respectively. To compare with Mil 


values, we compute the total projected mass within 
250 kpc using the 4-halo modeling results. We obtain 
M(< 250 kpc) = (1.49,1.25,0.76) x 10^^ M©, with typi¬ 
cal errors of ±0.35 x 10^^ M©, for the Core, W and NW 
clumps. Overall, we obtain mostly lower but consistent 
masses - our Core mass is (67 ± 22)% lower, our W is 
(113 ± 42)% higher, and our NW mass is (66 ± 30)% 
lower. 

Most notably. Mil showed the NW to be the second 
most massive clump, with a ^2:1 mass ratio to the main 
Core, whereas in our analysis the W clump is the second 
most massive instead, and our NW is the least massive 
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Figure 10. Azimuthally-averaged radial profiles of the tangential 
reduced-shear of the background galaxy sample (black squares), 
as in Figure 4, with comparison to the 1-halo model posterior shear 
(dashed red line) and its Icr scatter (shaded red region) and the 
4-halo model (in the case of T200 c = 1; solid black line and shaded 
gray region). 


Table 5 

Measured Offsets of DM halos 


Halo 

nearest BCG 
(") 

Mil’s DM halos 
(") 

X-ray feature 
(") 

Core 

W 

NE 

NW 

'79+34 

'^-53 

lelg® 

24ir 

8+28 

Note. 

— Errors 

are estimated as 

90% confidence 


bounds from the bootstrapped map offsets in each case, 
as in Figure 6. Column (2): Offset is given relative to 
BCG in the nearest clump of galaxies, used as a tracer 
of light. Column (3): Offset is given relative to published 
peak locations in Mil. Column (4): Offset is given relative 
to nearest X-ray feature - southern cool remnant core for 
Core, main X-ray emission for the W, and NW interloper 
for NW subhalo. 

clump. However, the determined mass ratios have large 
uncertainties, given our large errors on the masses for the 
satellite substructures. 

5.3. Separations between the Gas, Stars and DM in the 
Substruetures 

The different substructures identified in Section 4.3 
show some remarkable offsets from both the gas and 
galaxies in this complex merger. We summarize these 
offsets in Table 5 and discuss them below. We confirm 
the most massive structure (Core) to be located near 
(16tg^", quoting 90% CL) the designated Core in Mil, 
and north of where Owers et al. (2011) noted an X-ray 
cool southern remnant core (13lg^"). It is also associ¬ 
ated with the BCGs in this location, at about away, 

within the confidence bound (green contour in Figure 6 ). 

As noted above, we find in our analysis the second most 
massive peak to be the W clump, but surprisingly, to be 
separated from any BCG. The nearest BGG is 
further west. We note that the W substructure is highly 


extended southwest, indicating a possible secondary peak 
closer to the BGG. Mil’s W clump is much closer to 
the BGG in that location, and so is 80 ^ 54 " away from 
our W peak. Although the nearest X-ray emission peak 
is 9 lt 2 ?" further east, there is a clear extended X-ray 
emission pointing in its direction (Figure 6 ). 

Mil’s second-most significant peak is located NW 
of the core, associated with the second-most luminous 
clump of galaxies. We, on the other hand, find an 
extended mass distribution (clump NW) with its peak 
69lg3" north of Mil’s NW location, and about 87^28^^ 
east of the X-ray interloper peak. It is also 68 ^ 42 '' north 
of the closest BCG in the northwest clump of galaxies, 
and 581 ^ 4 " west of the closest BCG in the north clump of 
galaxies, so it seems to lie just between these two BCGs. 

Finally, the location of the NE clump is detected close 
south) to where Mil show a low-significance 
clump (not listed in their paper). The X-ray emission 
is extended toward the NE, though no X-ray peak is de¬ 
tected next to the NE substructure. The NE clump has 
the lowest significance = 4.7) of all four peaks, and 
it appears to be extended and bimodal. In fact, the con¬ 
vergence map shows another peak which is part of the 
same extended NE distribution, but at a lower signifi¬ 
cance (cT/^ = 4.48), and was therefore not included in our 
modeling. Although there are no clear BGGs in this lo¬ 
cation, there are several faint cluster members {Rq ~ 23 
mag). 

To demonstrate the different levels of mass-galaxy par¬ 
tition, we investigate the mass-to-light ratio inside a 
small aperture around each mass peak, ^ 250 kpc. We 
use the masses M 2 d(< 250kpc) derived from the 4-halo 
model (see Section 5.2). We estimate the total luminos¬ 
ity inside the same aperture using K-corrected i7c-band 
luminosities of the cluster member galaxies (the “green” 
sample, see Section 3.3.1). We find the projected mass- 
to-light ratios to be: M 2 d(< 250kpc)/I/i^^^(< 250kpc) = 
(103 ± 25, 584 ± 162,366 ± 134,153 ± 63) x M©/L© for 
the Core, W, NE and NW clumps, respectively. In 
comparison, the projected mass-to-light ratio of the en¬ 
tire cluster, calculated using the 1 -halo mass model, is 
^2d(< F200c)/Ai?c(< F200 c) = (241 ±38) X Mq/Lq. 
This value is typical for cluster-sized halos. The value we 
measure for the W peak is about 2.4 ± 0.8 times higher 
than that typical value, and hence reveals that this clump 
is deficient of galaxies. Einally, the Core shows a rather 
low mass-to-light ratio, 0.4 ±0.1 that of the entire clus¬ 
ter. This stems from the contribution of the two BCGs 
which are inside the Core inner 250 kpc. Lower values 
near cluster centers have also been indicated previously 
(Medezinski et al. 2010). 

5.4. Understanding the Merger Seenario 

Multifrequency observations show that A2744 is a 
very complex merging system (see our discussion above). 
Our WL reconstruction suggests that this complicated 
morphology is a result of multiple mergers due to in¬ 
falling subclusters. The details of the merging event in 
A2744, however, are not clear. The complex morphol¬ 
ogy of A2744 can only be explained with dedicated N- 
body/hydrodynamical simulations, which is out of the 
scope of this paper. Here we aim to provide a merely 
qualitative analysis based on a superposition of several 
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Figure 11. Contours of dark matter surface density (black) 
and X-ray emission (red) based on our A/'-body/hydrodynamical 
(FLASH) simulations of merging clusters. The projection angles 
were chosen to produce an image which resembles the observed 
morphology of A2744. See text for details. 

binary mergers from our existing archive of well estab¬ 
lished, self-consistent A/'-body/hydrodynamical simula¬ 
tions, which rely on the publicly available code FLASH 
(Fryxell et al. 2000). Our simulation assumes initially 
spherical clusters with gas following a non-isothermal [3 
model in hydrostatic equilibrium with a truncated NFW 
model for the dark matter. We used a box size of 13.3 
Mpc on a side, and the highest resolution (cell size) was 
of 12.7 kpc. Our simulations were semi-adiabatic in the 
sense that only adiabatic processes and shock heating 
were included. A detailed description of the setup for 
our simulations and a discussion of the input parameters 
can be found in Molnar et al. (2012). 

Different scenarios have been suggested recently for 
this system, although none of them can explain all the 
observed features (Mil; Owers et al. (2011)). These in¬ 
terpretations agree that the dominant X-ray emission is 
a result of a major merger along the north-south axis, 
and that its offset from the DM mass peaks is due to 
merging, but they differ on which component is the mas¬ 
sive cluster core. Owers et al. (2011), based on X-ray 
and galaxy velocity distributions, suggest that the main 
cluster is the northern component (identified as a peak 
in the galaxy surface density distribution to the north 
of the main X-ray peak; see left panel in Figure 13 of 
Owers et al. (2011)), and that the southern component 
(close to the southern X-ray compact cool core) is the 
infalling, less massive bullet subcluster. Based on a com¬ 
bined SL+WL analysis. Mil proposed that the southern 
component, which they find as the more massive one, is 
in fact the main core. Our WL analysis confirms Mil’s 
suggestion that the most massive component of A2744 
is the southern core (see Figure 1 and Figure 6 and Ta¬ 
ble 4). However, our WL analysis suggests that the direc¬ 
tion of the major merger (involving the two most massive 
components - the Core and W) is along the east-west di¬ 


rection, as opposed to Mil’s results which suggest a ma¬ 
jor merging event in the southeast-northwest direction 
(their Core and NWl and NW2 components). 

A likely explanation for our detected NE and W mass 
components is that they have passed through the core 
moving towards east and west, respectively, and lost 
most of their gas due to ram pressure from the more 
massive main halo. This picture explains the X-ray tails 
toward the northeast and southwest seen in the X-ray 
map (Figure 6). In order to explain the X-ray morphol¬ 
ogy extended to the north, we need to assume a south- 
north merger as well. Although the northern mass peak 
we found (see Figure 6) is not very significant (cr^ = 4.0) 
and was therefore not included in our modeling, we in¬ 
clude it here in our scenario, and assume it is much less 
massive than the Core. We label it accordingly as (N), 
to differ it from the other significant halos that were in¬ 
cluded in the mass modeling. If such a subhalo passed 
the Core from the east, moving from the south to the 
north, it can push the gas of the main cluster to the 
northwest. 

It is much more difficult to explain the X-ray interloper 
located to the northwest. Mil suggest that the inter¬ 
loper is a result of a slingshot due to a multiple merger 
event. According to Mil’s scenario, the mass peaks clos¬ 
est to the X-ray interloper (their NWl and NW2), passed 
through the main cluster from the southeast to north¬ 
west (see Figure 9 of Mil). As the subclusters climb out 
of the potential well of the main cluster moving to the 
northwest, both the ram pressure and the gravitational 
potential (which was temporarily enhanced by the NE 
and W subhalos passing through the core) drop, and the 
remaining gas components of NWl and NW2 suffer a 
slingshot effect and get ahead of their DM centers. How¬ 
ever, hydro dynamical simulations (Molnar et al. 2012) 
suggest that at this large distance from the Core of the 
main cluster (^ 600 kpc) the slingshot gas should have 
already fallen back to its DM halo (NW1-1-NW2), making 
the slingshot explanation of Mil unlikely. 

Similar to Mil, we find a significant mass concentra¬ 
tion to the northwest, clump NW, closest to the inter¬ 
loper (^393kpc). Our NW component is offset north of 
Mil’s NW component, but at about the same distance 
from the interloper. Even in this setup, the slingshot ef¬ 
fect is not probable, as the offset between our NW clump 
and the interloper is opposite to the NW-Core axis. Our 
NW component is elongated in the southeast-northwest 
direction, which suggests that it may comprise of two un¬ 
resolved, less massive halos. Under this assumption, we 
suggest an alternative scenario, where the two subclus¬ 
ters NWl and NW2 collide before they reach the main 
cluster. The less massive of the two subclusters is in¬ 
falling from the northwest close to the LOS through the 
larger one, displacing the gas from the potential well of 
the subclusters. As a result of the LOS projection, the 
X-ray peak appears to be located west of the two DM 
peaks. A different scenario to be considered is where the 
interloper gas may be tidally stripped from the W sub¬ 
structure by the northern mass peak (N). However, the 
large physical distance of the X-ray peak from both the 
W and the northern mass components makes this sce¬ 
nario highly unlikely, and in the following we pursue the 
infalling NW scenario to explain the interloper. 

In order to demonstrate the validity of our suggested 
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Table 6 

Simulations of the merging substructures 


Merger 

Mtot 

(1015 Mq) 

Mass Ratio 

P 

(Mpc) 

Vb 

(km s-l) 

W-Core 

1.4 

1.8 

0.2 

4500 

NE-Core 

0.8 

3 

0.15 

4500 

(N)-Core 

1.2 

5.2 

0.15 

4000 

NWI-NW2 

0.23 

6.67 

0.15 

4500 


Note. — Column (2): Total virial mass of the Core and 
satellite halo. Column (3): Mass ratio of the Core to satellite. 
Column (4): Impact parameter. Column (5): Infall velocity. 


scenario, we superimpose four separate binary simula¬ 
tions from our archive, three to explain the main X-ray 
emission and one to explain the NW interloper (see Ta¬ 
ble 6 ). The first simulation is used to represent the east- 
to-west merger, associated with component W, with a 
mass ratio of 1 : 1.8 (close to the observed value), a total 
mass of 1.4 x 10^^ Mq, an impact parameter of 0.2 Mpc 
and an infall velocity of 4500 km s“^. The second simula¬ 
tion represents the west-to-east merger, associated with 
component NE, assuming a mass ratio of 1:3, a total 
mass ofO.SxlO^^ Mq, an impact parameter of 0.15 Mpc 
and an infall velocity of 4500 km s“^. The third sim¬ 
ulation is used to represent the south-to-north merger, 
associated with component (N), assuming a mass ratio 
of 1:5.2, a total mass of 1.2 x 10^^ Mq, an impact param¬ 
eter of 0.15Mpc and an infall velocity of 4000kms“^. 
The fourth merging simulation is used to represent the 
NW minor merger of two smaller subhalos having a to¬ 
tal mass of 0.23 x 10^^ Mq, a mass ratio of 1:6.67, an 
impact parameter of 0.15 Mpc and an infall velocity of 
4500 km s“^. The masses and concentration parameters 
of the Core in our binary simulations lie in the range 
0.6 < Mvir/IO^^MQ < 1.0 and 5 < Cvir < 8 , as allowed 
by our mass modeling. We choose the outputs (epochs) 
and the viewing angles of each simulation to qualitatively 
reproduce the observed surface mass density distribution 
and X-ray morphology. We assumed that the final Core 
component in the different merging simulations is the 
same, but viewed at different epochs, thus represented 
by different “effective masses”. Therefore, when stack¬ 
ing the binary simulations we rescale the mass of the 
Cores so that it would be consistent with the observed 
mass (X]i=i 3 = 0-8 x 10^^ Mq). We used the same 

weights {wi) to rescale the associated Core X-ray emis¬ 
sion accordingly. In Figure 11 we show the contours of 
the DM surface density (black) and X-ray emission (red) 
of all four simulations superimposed, while normalizing 
the mass of the Core. We associate the mass peaks of 
our simulated image with those detected in our WL re¬ 
construction by labeling them Core, NE, W, NW and 
(N). 

The morphology of the main components resembles the 
observed features of A2744. The mass peaks of our simu¬ 
lations coincide with the locations of the observed Core, 
the W component and the non-significant peak (N). The 
extension of the main X-ray emission to the west towards 
the W component and to the north are clearly visible and 
resemble the Chandra observations. Our simulation also 
reproduces the mass peak elongated in the southeast- 
northwest direction similar to our NW component. The 


associated X-ray interloper emission is located just west 
of the NW mass peaks, as observed, although the offset 
in the simulated image is somewhat smaller , ^ 200 kpc. 
However, neither our binary merger nor Mil’s slingshot 
scenario can explain the northern edge of the interloper 
(which is interpreted as a cold front by Owers et al. 
(2011)) and all other observations consistently. More¬ 
over, systematics in the surface mass density reconstruc¬ 
tion (see Section 4.3) leading to large uncertainties in 
the positions of our less massive components make it dif¬ 
ficult to accurately constrain the level of offset between 
the NW mass peak and the X-ray interloper, and thus to 
strongly distinguish between suggested merger scenarios 
of this component. 

6. SUMMARY AND CONCLUSIONS 

We have presented an improved WL analysis of the 
merging cluster A2744, using new deep BRz' imaging 
from Subaru/Suprime-Cam. We reconstruct the total 
mass distribution out to about twice the virial radius of 
the cluster, 0 = 18' (approximately 5 Mpc at ^ = 0.308). 
The deep multi-band imaging allows us to isolate back¬ 
ground galaxies, and thus perform a robust WL analysis, 
removing contamination and dilution of the WL signal 
which is a dominant source of systematics in cluster WL 
analyses, particularly near cluster centers. Obtaining an 
undiluted mass map, we detect four distinct substruc¬ 
tures in the cluster central region with higher confidence 
than before, which we label Core, W, NE and NW, with 
significance of 7.9a^, 4.7<J/^ and respectively. 

We constrain the total mass of this massive cluster to 
be M 200 C = (2.06 ± 0.42) x 10^^ Mq. By simultaneously 
fitting an NEW halo and three tNFW halos to the four 
detected substructures, we constrain the substructures 
masses to be M 200 C = (0.77 ± 0.34) x 10^^ Mq for the 
southern Core, M 200 C = (0.45 ± 0.20) x 10^^ Mq for the 
W substructure, M 200 C = (0.28 ±0.16) x 10^^ Mq for the 
NE substructure and M 200 C = (0.19 ± 0.12) x 10^^ Mq 
for the NW substructure. 

Although on larger scales (> 1 Mpc) the gas, mass 
and galaxies seem to be distributed similarly, on smaller 
scales (< 200 kpc) there are significant inconsistencies 
between the positions of gas and the total mass, but 
also between the total mass and the galaxies. The mas¬ 
sive main core does appear close to the southern X-ray 
core, and close to the BCGs. Our second most massive 
peak (W), on the other hand, is offset 72 I 53 " east of the 
western BCG, toward the cluster core, and appears to 
be stripped of both gas and BCGs (though some lower 
brightness cluster members are detected), with M 2 d(< 
250kpc)/Li?c(< 250kpc) = (584±162)x Mq/Lq. Such 
an offset between DM and galaxies could imply that the 
DM is not as collisionless as the galaxies. However, given 
the large uncertainties on the substructure masses and 
their level of deficiency/offset relative to the galaxies, we 
cannot put a meaningful lower bound on the DM cross 
section for collision at this point. 

Based on the mass ratios we found, we argue that a 
major merger may have occurred along the west-east 
axis, with another south-north merger perturbing the 
main X-ray emission to the north. The X-ray interloper 
to the NW is harder to explain given our observations. 
We find it is unlikely to be the remnant of a slingshot 
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effect as suggested by Owers et al. (2011) and Mil, 
given the large distances we find between the different 
mass components. We offer an alternative explanation, 
where the NW clump is infalling into the cluster from the 
northwest, and is comprised of two subhalos that went 
through a separate third merger close to the LOS, caus¬ 
ing the gas to be stripped out and to appear as separate 
to both. To support our hypothesis, we provide actual 
Wbody/hydrodynamical simulations of binary mergers 
with mass ratios close to those derived from our WL re¬ 
construction that nicely reproduce many of the observed 
features. We used a superposition of independent binary 
mergers, yet it is clear that the multiple mergers have in¬ 
teracted with each other. Thus, one simulation of several 
mergers of more than two components is needed in order 
to fully explain the observed features of A2744, for which 
our analysis here may serve as a starting point (Molnar 
et ah, in prep.). 

Although consistent within 1 — 2cr with the subhalos 
masses and positions of Mil, we find some differences, 
most notably in the location of the W and NW sub¬ 
structures. We conclude based on the careful systemat- 
ics analysis in Section 4.3 and bootstrap tests, that the 
major differences between our WL analyses is mostly 
due to biased source selection. Another major differ¬ 
ence is not including SL constraints in our analysis as 
was done in Mil in the center of the field. Adding SL 
constraints, with their possibly underestimated errors, 
largely dominates over WL constraints, possibly giving 
more weight to the central mass core. Our WL analy¬ 
sis is improved compared to Mil’s wide-field WL anal¬ 
ysis in terms of seeing (FWHMi^^^ = 1.16" versus Mil’s 
FWHM^/ = 1.42"), depth {Rq < 26.83 versus Mil’s 
Rc < 26.31) and careful background selection. How¬ 
ever, the quality of our Subaru imaging here is still poor 
compared with the typical level achievable with Subaru, 
reaching FWHMi^^^ = 0.6" as in our previous WL studies 
(Medezinski et al. 2010; Umetsu et al. 2014). 

Including HST WL and SL analysis using the new, 
deep FF observations will further improve the constraints 
on the substructures masses and locations, which we plan 
to pursue in a subsequent paper (Merten et al. in prep). 
However, the current FOV of the HSTFF imaging covers 
the inner ^ 2' x 2', only probing the Core and part of the 
NW substructure region, and not fully covering the other 
satellite substructures. Furthermore, combining obser¬ 
vations of different instruments and depths (as Subaru 
and HST) may further lead to systematics, necessitating 
careful weighting of the different regions of interest. An 
optimal approach would be to tile over the inner < 5' 
with HST/ACS with several pointings for full coverage 
of the four substructures identified, which would provide 
an order of magnitude improvement in the spatial reso¬ 
lution of the WL mass map. 
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